IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



In re Application of 
ROB A. ' BEUKER 



Title: MOTION ESTIMATION 



Filed: CONCURRENTLY 



Serial No. 



Group Art Unit: 



Examiner: 



Atty . Docket 
PHN 17,569 




Honorable Commissioner of Patents and Trademarks 
Washington, D.C. 20231 



CITATION OF RELATED CASES 



Sir: 



Attached is a report which was made by the assignee of 
the above-identified patent application. 

The United States patent applications and issued 



patents identified in this report may be relevant to the 
examination of the above- identified patent application inasmuch 
as they have been identified by an automated search of the 
assignee's patent portfolio files as having common inventors 
with and/or subject matter which is classified by the assignee 
in the same technological field as the above-identified patent 
application. However, citation of this report is neither an 
admission that any document noted therein is prior art to the 
above- identified patent application nor a waiver of the 
confidential status of any listed patent application under 35 
U.S.C. 122. 



Respectfully submitted 




Edward W f^Goodfiafi , Reg . 28,613 
Attorney 
(914) 333-9611 



Enc. 



F : \WPDOCS\GO\MW19GOK0 . DSO . DOC 



0 , 



> » 



Related Cases /Technology Report for PHN 17569 Q 

BEtJfcER, ROB A. nc/- Q Attorney: GOODMAN * 

Docket No.: PHN 17569 „ . . „ 

Serial No. : 

Patent No. : 
OS Codes : TV 4 300 
Title : MOTION ESTIMATION. 




Common Inventors 



SHAH, IMRAN A. 
BEUKER, ROB A. 



Docket No. 
Patent No. 
OS Codes 
Title 



Attorney : GOODMAN 



Serial No. 



07/898775 



PHN 13764 
5239377 
DV1000 

DEVICE FOR SPLITTING A DIGITAL INTERLACED 
INTO COMPONENTS. 



BEUKER, ROB A. 
SHAH, IMRAN A. 



Docket No. 
Patent No. 
OS Codes 
Title 



Attorney: GOODMAN 
PHN 14152 e . - VT 

5416644 Serial No * : 08/097644 

DV1000 VR0100 

DEVICE FOR PROCESSING AN INTERLACED FRAME SIGNAL 
ON A TELEVISION DISPLAY WHEN THE SIGNAL IS 



BEUKER, ROB A. 
THEUNIS, HENDRIK G. 
HEUSDENS, RICHARD 



Docket No. 
Patent No. 
OS Codes 
Title 



PHN 15521 



Attorney: GATHMAN 
Serial No. : Q8/8 60105 



DV1000 

METHOD OF ENCODING VIDEO IMAGES. 



KALKER, ANTONIUS A. 
BEUKER, ROB A. 
THEUNIS, HENDRIK G. 



Docket No. 
Patent No. 
OS Codes 
Title 



Attorney : GATHMAN 
PHN 15889 e . . „ 

Serial No.: 08/891605 
5999655 uu/osiouo 

DV1000 

TRANSMISSION AND RECEPTION OF ENCODED VIDEO 



Related Technology 



BOLAND, LIZA 
JANSSEN, JOHAN 



Docket No. 
Patent No. 
OS Codes 
Title 



US 000009P 



Attorney : GATHMAN 
Serial No. : 



TV 4300 

VIDEO INTERPOLATION AND DECIMATION USING A 
VARIABLE DELAY FILTER, AND COMBINED WITH A 



HE, HA I Y AN 

HENTSCHEL, CHRISTIAN 



Docket No. 
Patent No. 
OS Codes 
Title 



US 000057 



Attorney : GATHMAN 
Serial No. : Q9/519548 



TV 4300 

SUBJECTIVE NOISE MEASUREMENT ON ACTIVE VIDEO 



mi 



Europaisches 
Patentamt 



European 
Patent Office 



Office europeen 
des brevets 



US ' 



Bescheinigung Certificate Attestation 



Die angehefteten Unter la- 
gen stimmen mit der 
ursprunglich eingereichten 
Fassung der auf dem nach- 
sten Blatt bezeichneten 
europaischen Patentanmel- 
dung uberein. 



The attached documents 
are exact copies of the 
European patent application 
described on the following 
page, as originally filed. 



Les documents fixes a 
cette attestation sont 
conformes a la version 
initialement deposee de 
la demande de brevet 
europeen sped flee a la 
page suivante. 



Patentanmeldung Nr. Patent application No. Demande de brevet n° 

99202532.0 



Der President des Europaischen Patentamts; 
Im Auftrag 

For the President of the European Patent Office 

Le President de I'Office europeen des brevets 
p.o. 




I.L.C. HATTEN-HECKMAN 



DEN HAAG,DEN 

THE HAGUE, 22/05/00 

LA HAYE,LE 



EPA/EPO/OEB Form 1014 -02.91 



Europaisches European Office europeen 

<A Patentamt Patent Office des brevets 



Blatt 2 der Bescheinigung 
Sheet 2 of the certificate 
Page 2 de I'attestation 



Anmeldung Nr.: Anmeldetag: 

Application no.: 99202532.0 Date of filing: 02/08/99 

Demande n* : Date de depot: 

Anmelder: 

Applicant(s): 

Demandeur(s): 

Kon1nkl1jke Philips Electronics N.V. 

5621 BA Eindhoven 

NETHERLANDS 



Bezeichnung der Erfindung: 
Title of the invention: 
Titre de I'invention: 

Motion estimation 



In Anspruch genommene Prioriat(en) / Priority(ies) claimed / Priorite(s) revendiquee(s) 

Staat: Tag: Aktenzeichen: 

State: Date: File no. 

Pay S: Date: Numero de depot: 



I nternationale Patentklassifi kation : 
International Patent classification: 
Classification internationale des brevets: 

H04N7/36 



Am Anmeldetag benannte Vertragstaaten: 

Contracting states designated at date of filing: AT/BE/CH/CY/DEA5K^S^I/FR/GB/GR/IE/IT/LI/LU/MC/NLyPT/SE 
Etats contractants designes lors du depot: 

Bemerkungen: 

Remarks: 

Remarques: 



EPA/EPO/OEB Form 1012 -04.98 




EPO - DG 1 



i 



02. oa 1999 



20.07.1999 



Motion estimation. 



The invention relates to a method and device for motion estimation, a motion- 



compensated picture signal processing device comprising such a motion estimation device, 
and a picture display apparatus comprising such a motion-compensated picture signal 
processing device. 
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A prior art motion estimation technique, called "3-D Recursive Search", has 



been described by Gerard de Haan and Robert Jan Schutten, "Real-time 2-3 pull-down 
elimination applying motion estimation/compensation in a programmable device", IEEE 
Transactions on Consumer Electronics, VoL 44, No. 3, August 1988, pp. 930-938. 3-D 

10 Recursive Search falls in the class of pixel- or block-recursive motion estimators. The 

algorithm is based in the following assumptions: motion does not change much in time, i.e. 
from frame to frame. The algorithm maintains a motion field and tries to update this field only 
when necessary. The motion field is usually similar for a relatively large region, i.e. for an 
object. Therefore the motion vectors in the neighborhood of a location are good candidates for 

15 the motion in that location. Video consists of a sequence of frames. Each frame is divided into 
blocks, e.g. of 16x16 pixels. A motion vector is associated with each block. The motion vector 
should hold the displacement between the block in the current frame compared to the previous 
frame. Suppose that we want to update the motion vector of block (x,y) in the current frame. 
3-D Recursive Search uses only a limited number of candidate vectors, say five, for the 

20 estimation, viz. some vectors from the previous frame, i.e. temporal vectors, some vectors 
from the current frame, i.e. spatial vectors, and an update of a spatial vector. For each 
candidate the motion estimation error is calculated. The candidate with the lowest motion 
estimation error is chosen as the best motion vector for that block. The algorithm uses the 
normal raster scan order to go through the blocks. 

25 WO-A-97/46,022 discloses a method of estimating motion vectors, in which 

motion parameters are determined for a given field of a video signal, and motion vectors for a 
subsequent field of the video signal are determined in dependence upon at least one 
predetermined motion vector (i.e. a motion vector already estimated for a spatio-temporally 
neighboring block) and at least one additional motion vector derived from the motion 



2 20.07.1999 
parameters. The motion parameters for the given field may be derived from motion vectors 
determined for the given field, e.g. by applying a two-dimensional histogram operation on the 
motion vectors determined for the given field. 

It is, inter alia, an object of the invention to provide an improved motion 
estimation technique. To this end, the invention provides a motion estimation method and 
device, a motion-compensated picture signal processing apparatus, and a picture display 
apparatus as defined in the independent claims. Advantageous embodiments are defined in the 
dependent claims. 

In a motion vector estimation method in accordance with a primary aspect of 
the present invention, a block-based motion vector estimation process is carried out to 
determine block-based motion vectors, at least a most frequently occurring block-based 
motion vector is determined, a global motion vector estimation process using at least the most 
frequently occurring block-based motion vector is carried out to obtain a global motion vector, 
and the global motion vector is applied to the block-based motion vector estimation process. 

These and other aspects of the invention will be apparent from and elucidated 
with reference to the embodiments described hereinafter. 

The drawing shows a functional block diagram of an embodiment of a motion- 
compensated picture display apparatus in accordance with the present invention. 

When using 3-D Recursive Search for global motion estimation, we have as 
main task: how to obtain one motion vector from all the motion vectors? Our approach is 
based on the following steps: 

1. The global motion vector is defined as the most used motion vector in 
appropriate blocks. We also use the second-most used motion vector. A block is appropriate if 
the motion estimation error is small enough and the block contains enough detail. 

2. Make the motion field smooth by introducing a global motion vector candidate. 

In fact, we use two motion estimators, a normal motion estimator that is using 
the global motion vector and a global motion estimator. First we will describe how we extract 
a global motion from the motion field. We will describe the properties of each motion 
estimator in the next sections. Finally, it is described how both motion fields are used to build 
the global motion estimator. 
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Why do we need two motion vectors? We use the normal motion estimator to 
track the changes. From the associated motion field we cannot obtain the correct global 
motion, but only candidate global motion vectors. We use a global motion estimator, still 
using the 3-D Recursive Search concept, for selecting the best global motion vector. We 
cannot solely use this global motion estimator because it is not capable of tracking changes. 

The global motion is extracted from the motion field in two steps: count for all 
"appropriate" blocks the number of times that a motion vector is used, and obtain from these 
counts the most and second-most used motion vectors. 

A block is "appropriate" if the motion estimation error is small enough (average 
SAD smaller than 30), and the block contains enough activity (activity larger than 50) where 
the activity per block is defined as: activity = max<y)(y(i,j)) - min(ij)(y(i,j)). We remove the 
blocks with a low activity, because the motion estimation is not reliable for blocks without 
detail. Currently, we use the Sum-of-Absolute-Difference measure (SAD) for the displacement 
error. Let y[i,j] and y prev [i,j] denote the pixel values of the current frame and previous frame, 
respectively. The Sum-of-Absolute-Difference measure is calculated by: 

SAD = ^\y{iJ)-y prtv (i + MV x ,j + MV y ) \, 

where (MV x ,MV y ) is the candidate motion vector and the summation is over the block. The 
SAD is set to "0" if it is smaller than a threshold to remove the influence of fixed pattern noise. 

We also use the second-most used motion vector to improve the robustness of 
the algorithm. We found that sometimes the algorithm will favor the zero-motion vector, 
whereas there is some camera panning. Supplying the global motion estimator with both 
vectors solves this situation. 



The 3-D Recursive Search estimator uses the following 6 candidates: 

1. The most-used global motion vector (which is also used as best global motion 
vector). 

2. The spatial vector of block (x- l,y- 1) (upper left). 

3. The spatial vector of block (x+ l,y- 1) (upper right). 

4. The temporal vector of the current block. 

5. The temporal vector of block (x,y+l) (lower). 

6. An update of the spatial vector of block (x- 1, y- 1) if x is even and of block 



(x+l,y-l) if x is odd. 
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The update is obtained as follows. The update vector is the sum of the spatial 
vector and a delta vector. The delta vector (dx, dy) is read from a list of 16 possible delta 
vectors. The list of sixteen possible delta vectors is given in the next table. 



j 


0 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 


12 


13 


14 


15 


dx 


0 


0 


0 


0 


4 


-4 


12 


-12 


0 


0 


1 


-1 


0 


0 


2 


-2 


dy 


4 


-4 


8 


-8 


0 


0 


0 


0 


1 


-1 


0 


0 


2 


-2 


0 


0 



The table shows that the maximum update per vector is 12 pixels horizontally and 8 pixels 
vertically. 

How do we select a delta vector? We simply use the next delta vector in the list 
for the next block and start with delta vector 0. Suppose that element j is used for block (x,y), 
then we use element j+1 for the next block (i.e. block (x+l,y)) and we use delta element "0" if 
j+1 equals sixteen. 

Each candidate is checked to see whether the resulting address is valid, i.e. 
points to an area within the frame. If not, the vector is clipped to the nearest valid motion 
vector. 



The next table shows the penalties for each vector type. Penalties are added to 
block matching errors (SAD) in order to favor certain candidate motion vectors over other 
candidate motion vectors in order to smoothen the motion field. 



Vector type 


Penalty (per block) 


Global motion vector 


3 


Spatial vector 


0 


Temporal vector current block 


0 


Temporal vector lower block 


8 


Update vector 


32 



The global motion estimator uses the following four candidates: 

1. Most-used motion vector obtained by the block-based 3-D Recursive Search 
estimator. 

2. Second-most used motion vector obtained by the block-based 3-D Recursive 
Search estimator. 

3. Cyclically varying updates of the motion vector mentioned at 1 (see above). 
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4. Cyclically varying updates of the motion vector mentioned at 2 (see above). 

On a block basis, the global estimator determines which of the four candidates 
is the best one. From these best candidates determined on a block basis, the most-frequently 
occurring one is retained. 



5 The penalties for each type are: 



Vector type 


Penalty (per block) 


Global motion vector 


Oor 1 


Update of global motion vector 


32 



The penalty for the global motion candidate is 1 if the motion vector is zero and 
0 otherwise. This may be simplified to "0" only without losing accuracy. 

The full global motion estimator uses the following steps per frame: 
10 1 . Get the best global motion vector from the global motion estimation. 

2. Use this vector for the 6-candidate 3-D Recursive Search motion estimation. 

3. Extract the most used and second-most used global motion vectors from the 
resulting motion field. 

4. Use these motion vectors in the global motion estimation, i.e. in the four- 
15 candidate motion estimation. 

5. The global motion is extracted from the resulting motion field and used in step 
1. 

To reduce the CPU load of the algorithm, the number of motion estimation 
20 blocks is reduced by sub-sampling. Since we only require one motion vector per frame, the 
global motion vector, we do not need to calculate a motion vector for each block, so that the 
number of used blocks can be sub-sampled. We currently use a sub-sampling factor of two 
horizontally and two vertically. Note that we may be able to use a factor of four for the global 
motion estimation, if necessary. The sub-sampling factor is limited for the following reasons: 
25 A too high sub-sampling factor reduces the probability that there are "appropriate candidates" 
(blocks with a small motion estimation error and a sufficiently high activity). Moreover, using 
too few blocks will reduce the smoothness of the motion field. In addition, it is possible to 
apply sub-sampling within a block to reduce the number of pixels. 

Using processor-specific features, such as MMX, also helps in speeding up the 
30 computation. Also, the time spend in the SAD calculation can in principle be reduced by using 
cross correlation. 
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To improve the global motion estimation algorithm, the following measures are 
possible. Retain not only the most used and second-most used global motion vectors, but also 
less frequently used motion vectors. Use only the central part of the current frame for motion 
estimation, e.g. a quarter of the frame. If there is some rotation (with the middle of the frame 
5 as center of rotation), the blocks in the outer area of the frame will contain more displacement 
than the central part Note that this latter measure will also reduce the computational load. 



The drawing shows a functional block diagram of an embodiment of a motion- 
compensated picture display apparatus in accordance with the present invention. A picture 

10 signal is applied to a block-based motion vector estimator BME and to a global motion-vector 
estimator GME that operate as set out above. The block-based motion vector estimator BME 
applies a most frequently used motion vector MFMV and a second-most frequently used 
motion vector SMFMV to the global motion- vector estimator GME. The global motion- vector 
estimator GME applies a global motion vector GMV to the block-based motion vector 

15 estimator BME. The picture signal is also applied to a motion-compensated processor MCP 
for carrying out, e.g. a motion-compensated interpolation (say, a 100 Hz conversion) or a 
motion-compensated stitching of images obtained by a scanner or video camera. The motion- 
compensated processor is controlled by either block-based motion vectors supplied by the 
block-based motion vector estimator BME or global motion vectors supplied by the global 

20 motion estimator GME. A switch S symbolically indicates this choice. In practice, depending 
on the application, there is no switch S and the appropriate type of motion vectors is used. 
Global vectors will e.g. be used for stitching scanned images, while block-based vectors will 
e.g. be used for 100 Hz conversion. The output of the motion-compensated processor MCP is 
applied to a display device DD. In other applications of the invention, such as in a scanner, the 

25 output of the motion-compensated processor MCP will be printed on paper. 



It should be noted that the above-mentioned embodiments illustrate rather than 
limit the invention, and that those skilled in the art will be able to design many alternative 
embodiments without departing from the scope of the appended claims. Where in the above- 
30 described examples only a most-used and a second-most used vector are used, it is an obvious 
generalization clearly falling within the scope of the claims to use the N most-used vectors. In 
the claims, any reference signs placed between parentheses shall not be construed as limiting 
the claim. The word "comprising" does not exclude the presence of elements or steps other 
than those listed in a claim. The word "a" or "an" preceding an element does not exclude the 
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presence of a plurality of such elements. The invention can be implemented by means of 
hardware comprising several distinct elements, and by means of a suitably programmed 
computer. In the device claim enumerating several means, several of these means can be 
embodied by one and the same item of hardware. 



CLAIMS: 
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1. A motion vector estimation method, comprising the steps of: 
carrying out a block-based motion vector estimation process (BME) to 

determine block-based motion vectors; 

determining at least a most frequently occurring block-based motion vector 

5 (MFMV); 

carrying out a global motion vector estimation process (GME) using at least the 
most frequently occurring block-based motion vector (MFMV) to obtain a global motion 
vector (GMV); and 

applying the global motion vector (GMV) to the block-based motion vector 
10 estimation process (BME). 

2. A method as claimed in claim 1, wherein the determining step includes making 
a selection among block-based motion vectors having a corresponding motion error below a 
given motion error threshold. 

15 

3. A method as claimed in claim 1, wherein the determining step includes making 
a selection among block-based motion vectors estimated for blocks having a difference 
between maximum and minimum pixel values above a given activity threshold. 

20 4. A method as claimed in claim 1, wherein both the most frequently occurring 

block-based motion vector (MFMV) and a second-most frequently occurring block-based 
motion vector (SMFMV) are determined and used in the global motion vector estimation 
process (GME). 

25 5. A method as claimed in claim 1, wherein said global motion vector estimation 

process includes the steps of: 

comparing, on a block basis, a plurality of candidate vectors including the most 
frequently occurring block-based motion vector (MFMV) to obtain best vectors determined 
per block; 
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outputting a most-frequently occurring best vector determined per block as the 
global motion vector (GMV). 

* » 

6. A motion vector estimation device, comprising: 

5 block-based motion vector estimation means (BME) for determining block- 

based motion vectors; 

means for determining at least a most frequently occurring block-based motion 
vector (MFMV, SMFMV); 

means (GME) for carrying out a global motion vector estimation process using 
10 at least the most frequently occurring block-based motion vector (MFMV, SMFMV) to obtain 
a global motion vector, and 

means for applying the global motion vector (GMV) to the block-based motion 
vector estimation means (BME). 



15 7. A motion-compensated picture signal processing apparatus, comprising: 

a motion vector estimation device as claimed in claim 6 for generating motion 

vectors; and 

a motion-compensated processor (MCP) for processing a picture signal in 
dependence on the motion vectors. 

20 

8. A picture display apparatus, comprising: 

a motion-compensated picture signal processing apparatus as claimed in claim 7 
to obtain a processed picture signal; and 

a display device for displaying the processed picture signal. 
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ABSTRACT: 
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In a motion vector estimation method, a block-based motion vector estimation 
process (BME) is carried out to determine block-based motion vectors, at least a most 
frequently occurring block-based motion vector (MFMV) is determined, a global motion 
vector estimation process (GME) using at least the most frequently occurring block-based 
5 motion vector (MFMV) is carried out to obtain a global motion vector (GMV), and the global 
motion vector (GMV) is applied to the block-based motion vector estimation process (BME). 



(Fig. 1) 
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